skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Search for: All records

Creators/Authors contains: "Tenenbaum, Joshua B"

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. A "quine" is a deterministic program that prints itself. In this essay, I will show you a "gauguine": a probabilistic program that infers itself. A gauguine is repeatedly asked to guess its own source code. Initially, its chances of guessing correctly are of course minuscule. But as the gauguine observes more and more of its own previous guesses, it detects patterns of behavior and gains information about its inner workings. This information allows it to bootstrap self-knowledge, and ultimately discover its own source code. We will discuss how—and why—we might write a gauguine, and what we stand to learn by constructing one. 
    more » « less
    Free, publicly-accessible full text available October 9, 2026
  2. Free, publicly-accessible full text available July 1, 2026
  3. Free, publicly-accessible full text available May 1, 2026
  4. Adults can calculate probabilities by running simulations and calculating proportions of each outcome. How does this ability develop? We developed a method that lets us bring computational modeling to bear on this question. A study of 40 adults and 31 4-year-olds indicates that unlike adults, many 4-year-olds use a single simulation to estimate probability distributions over simulated possibilities. We also implemented the 3-cups task, an established test of children’s sensitivity to possibilities, in a novel format. We replicate existing 3-cups results. Moreover, children who our model categorized as running a single simulation on our novel task show a signature of running a single simulation in the 3-cups task. This signature is not observed in children who were categorized as running multiple simulations. This validates our model and adds to the evidence that about half of 4-year-olds don’t evaluate multiple candidates for reality in parallel. 
    more » « less
    Free, publicly-accessible full text available May 13, 2026
  5. Free, publicly-accessible full text available May 13, 2026
  6. Free, publicly-accessible full text available October 22, 2026
  7. Griffiths, Thomas L; Chater, Nick; Tenenbaum, Joshua T (Ed.)
  8. How do people perform general-purpose physical reasoning across a variety of scenarios in everyday life? Across two stud ies with seven different physical scenarios, we asked participants to predict whether or where two objects will make contact. People achieved high accuracy and were highly consistent with each other in their predictions. We hypothesize that this robust generalization is a consequence of mental simulations of noisy physics. We designed an “intuitive physics engine” model to capture this generalizable simulation. We find that this model generalized in human-like ways to unseen stimuli and to a different query of predictions. We evaluated several state-of-the-art deep learning and scene feature models on the same task and found that they could not explain human predictions as well. This study provides evidence that human’s robust generalization in physics predictions are supported by a probabilistic simulation model, and suggests the need for structure in learned dynamics models. 
    more » « less
  9. Effective planning in the real world requires not only world knowledge, but the ability to leverage that knowledge to build the right representation of the task at hand. Decades of hierarchical planning techniques have used domain-specific temporal action abstractions to support efficient and accurate planning, almost always relying on human priors and domain knowledge to decompose hard tasks into smaller subproblems appropriate for a goal or set of goals. This paper describes Ada (Action Domain Acquisition), a framework for automatically constructing task-specific planning representations using task-general background knowledge from language models (LMs). Starting with a general-purpose hierarchical planner and a low-level goal-conditioned policy, Ada interactively learns a library of planner-compatible high-level action abstractions and low-level controllers adapted to a particular domain of planning tasks. On two language-guided interactive planning benchmarks (Mini Minecraft and ALFRED Household Tasks), Ada strongly outperforms other approaches that use LMs for sequential decision- making, offering more accurate plans and better generalization to complex tasks. 
    more » « less